112 research outputs found

    Approximate kernel clustering

    Full text link
    In the kernel clustering problem we are given a large nΓ—nn\times n positive semi-definite matrix A=(aij)A=(a_{ij}) with βˆ‘i,j=1naij=0\sum_{i,j=1}^na_{ij}=0 and a small kΓ—kk\times k positive semi-definite matrix B=(bij)B=(b_{ij}). The goal is to find a partition S1,...,SkS_1,...,S_k of {1,...n}\{1,... n\} which maximizes the quantity βˆ‘i,j=1k(βˆ‘(i,j)∈SiΓ—Sjaij)bij. \sum_{i,j=1}^k (\sum_{(i,j)\in S_i\times S_j}a_{ij})b_{ij}. We study the computational complexity of this generic clustering problem which originates in the theory of machine learning. We design a constant factor polynomial time approximation algorithm for this problem, answering a question posed by Song, Smola, Gretton and Borgwardt. In some cases we manage to compute the sharp approximation threshold for this problem assuming the Unique Games Conjecture (UGC). In particular, when BB is the 3Γ—33\times 3 identity matrix the UGC hardness threshold of this problem is exactly 16Ο€27\frac{16\pi}{27}. We present and study a geometric conjecture of independent interest which we show would imply that the UGC threshold when BB is the kΓ—kk\times k identity matrix is 8Ο€9(1βˆ’1k)\frac{8\pi}{9}(1-\frac{1}{k}) for every kβ‰₯3k\ge 3

    A Characterization of Approximation Resistance for Even kk-Partite CSPs

    Full text link
    A constraint satisfaction problem (CSP) is said to be \emph{approximation resistant} if it is hard to approximate better than the trivial algorithm which picks a uniformly random assignment. Assuming the Unique Games Conjecture, we give a characterization of approximation resistance for kk-partite CSPs defined by an even predicate

    Hardness of Bipartite Expansion

    Get PDF
    We study the natural problem of estimating the expansion of subsets of vertices on one side of a bipartite graph. More precisely, given a bipartite graph G(U,V,E) and a parameter beta, the goal is to find a subset V\u27 subseteq V containing beta fraction of the vertices of V which minimizes the size of N(V\u27), the neighborhood of V\u27. This problem, which we call Bipartite Expansion, is a special case of submodular minimization subject to a cardinality constraint, and is also related to other problems in graph partitioning and expansion. Previous to this work, there was no hardness of approximation known for Bipartite Expansion. In this paper we show the following strong inapproximability for Bipartite Expansion: for any constants tau, gamma > 0 there is no algorithm which, given a constant beta > 0 and a bipartite graph G(U,V,E), runs in polynomial time and decides whether - (YES case) There is a subset S^* subseteq V s.t. |S^*| >= beta*|V| satisfying |N(S^*)| <= gamma |U|, or - (NO case) Any subset S subseteq V s.t. |S| >= tau*beta*|V| satisfies |N(S)| >= (1 - gamma)|U|, unless NP subseteq intersect_{epsilon > 0}{DTIME}(2^{n^epsi;on}) i.e. NP has subexponential time algorithms. We note that our hardness result stated above is a vertex expansion analogue of the Small Set (Edge) Expansion Conjecture of Raghavendra and Steurer 2010

    A New Multilayered PCP and the Hardness of Hypergraph Vertex Cover

    Get PDF
    Given a kk-uniform hyper-graph, the Ekk-Vertex-Cover problem is to find the smallest subset of vertices that intersects every hyper-edge. We present a new multilayered PCP construction that extends the Raz verifier. This enables us to prove that Ekk-Vertex-Cover is NP-hard to approximate within factor (kβˆ’1βˆ’Ο΅)(k-1-\epsilon) for any kβ‰₯3k \geq 3 and any Ο΅>0\epsilon>0. The result is essentially tight as this problem can be easily approximated within factor kk. Our construction makes use of the biased Long-Code and is analyzed using combinatorial properties of ss-wise tt-intersecting families of subsets

    On the hardness of learning intersections of two halfspaces

    Get PDF
    AbstractWe show that unless NP=RP, it is hard to (even) weakly PAC-learn intersection of two halfspaces in Rn using a hypothesis which is a function of up to β„“ halfspaces (linear threshold functions) for any integer β„“. Specifically, we show that for every integer β„“ and an arbitrarily small constant Ξ΅>0, unless NP=RP, no polynomial time algorithm can distinguish whether there is an intersection of two halfspaces that correctly classifies a given set of labeled points in Rn, or whether any function of β„“ halfspaces can correctly classify at most 12+Ξ΅ fraction of the points
    • …
    corecore